Boosting the Potential of Large Language Models with an Intelligent Information Assistant

Authors:

Yujia Zhou→ Tsinghua University, zhouyujia@mail.tsinghua.edu.cn
Zheng Liu→ The Hong Kong Polytechnic University, zhengliu1026@gmail.com
Zhicheng Dou→ Renmin University of China, dou@ruc.edu.cn

🎉 Supported by NeurIPS 2024

🎯 Introduction

With the rise of Large Language Models (LLMs), significant advancements have been made in the field of Natural Language Processing (NLP). However, these models often generate incorrect information, a phenomenon known as “hallucination.” Traditional Retrieval-Augmented Generation (RAG) methods have shown inadequate performance in handling complex reasoning tasks, particularly those that require multiple steps.

Motivation

The motivation for developing the ASSISTRAG framework includes:

LLM Hallucination: LLMs frequently produce inaccurate information.
Inadequate RAG Performance: RAG methods struggle with complex and multi-step reasoning tasks.
Impact of Prompting and Fine-Tuning: Techniques like prompting and fine-tuning can degrade the original capabilities of LLMs.
Need for Frequent Retraining: Maintaining performance requires timely retraining of the models.

🔬 ASSISTRAG Framework

2.1 Main Components

The ASSISTRAG framework consists of two primary components:

Main LLM: Static, responsible for generating answers.
Assistant LLM: Trainable, responsible for information management.

2.2 Core Functions

ASSISTRAG possesses six core capabilities:

Memory Management

Retrieving Relevant Information: Retrieve relevant information from the system’s memory based on the current question.
Evaluating Relevance: Determine if the retrieved information is relevant to answering the current question.
Storing New Insights: If the main LLM generates new insights that are not already stored in the system’s memory, record them for future use.

Knowledge Management

Question Decomposition: Break down the question into multiple sub-queries.
Knowledge Retrieval: Retrieve relevant documents from an external knowledge base to support the sub-queries.
Knowledge Extraction: Extract the necessary knowledge from the retrieved documents to answer the original question.
Evaluating Relevance: Determine if the extracted knowledge should be included in the response generation process.

Other Core Functions

Tool Usage: Retrieving information from both internal memory and external knowledge bases.
Action Execution: Processing, analyzing, and extracting information.
Plan Specification: Determining the necessity of each step in the process.

📚 Training Methodology

ASSISTRAG employs a three-phase training approach:

Foundational Curriculum Learning:
- Establish the assistant’s basic capabilities.
- Gradually increase the complexity of training tasks.
- Ensure the assistant can handle a wide range of queries.
Specialized Curriculum Learning:
- Focus on enhancing the assistant’s abilities in specific domains or reasoning skills.
- Introduce more challenging tasks and scenarios.
- Refine the assistant’s knowledge and problem-solving strategies.
Reinforced Preference Optimization:
- Adjust the assistant’s output based on feedback from the main LLM.
- Optimize the assistant’s responses to better align with the main LLM’s preferences.
- Ensure the assistant’s outputs are consistent with the main LLM’s desired outputs.

📈 Experimental Setup and Results

4.1 Experimental Setup

Experiments were conducted using multiple complex question-answering datasets to evaluate the performance of ASSISTRAG.

4.2 Experimental Results

ASSISTRAG demonstrated superior performance across various foundational LLMs, particularly providing significant benefits to weaker models.
The framework exhibited advantages in accuracy, efficiency, and cost-effectiveness.

🏁 Conclusion

The proposed ASSISTRAG effectively enhances LLMs’ performance in complex reasoning tasks. Future work will focus on expanding the assistant’s capabilities, including long-text processing and personalized support.

📖 References

Slide
Paper

Comments:

This is a shared paper, not written by me.